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PROJECTED REFLECTED GRADIENT METHODS FOR 
MONOTONE VARIATIONAL INEQUALITIES 

YU. MALITSKYt 


Abstract. This paper is concerned with some new projection methods for solving variational 
inequality problems with monotone and Lipschitz-continuous mapping in Hilbert space. First, 
we propose the projected reflected gradient algorithm with a constant stepsize. It is similar to 
the projected gradient method, namely, the method requires only one projection onto the feasible 
set and only one value of the mapping per iteration. This distinguishes our method from most 
other projection-type methods for variational inequalities with monotone mapping. Also we prove 
that it has R-linear rate of convergence under the strong monotonicity assumption. The usual 
drawback of algorithms with constant stepsize is the requirement to know the Lipschitz constant 
of the mapping. To avoid this, we modify our first algorithm so that the algorithm needs at most 
two projections per iteration. In fact, our computational experience shows that such cases with 
two projections are very rare. This scheme, at least theoretically, seems to be very effective. All 
methods are shown to be globally convergent to a solution of the variational inequality. Preliminary 
results from numerical experiments are quite promising. 

Key words, variational inequality, projection method, monotone mapping, extragradient 
method 
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1. Introduction. We consider the classical variational inequality problem 
(VIP) which is to hnd a point x* € C such that 

{F{x*),x — X*) > 0 Vx G C, (1.1) 

where C is a closed convex set in Hilbert space H, (•, •) denotes the inner product 
in H, and F: H ^ H is some mapping. We assume that the following conditions 
hold 

(Cl) The solution set of (1.1), denoted by S, is nonempty. 

(C2) The mapping F is monotone, i.e., 

{F{x)- F{y),x-y)>0 \/x,y e H. 

(C3) The mapping F is Lipschitz-continuous with constant L > 0, i.e., there 
exists L > 0 such that 

\\F{x) - F{y)\\ < L\\x - y\\ Mx,y & H. 

The variational inequality problem is one of the central problems in nonlinear 
analysis (see [2,5,21]). Also monotone operators have turned out to be an important 
tool in the study of various problems arising in the domain of optimization, nonlin¬ 
ear analysis, differential equations and other related fields (see [3,22]). Therefore, 
numerical methods for VIP with monotone operator have been extensively studied 
in the literature, see [5,10] and references therein. In this section we briefly con¬ 
sider the development of projection methods for monotone variational inequality 
that provide weak convergence to a solution of (1.1). 

The simplest iterative procedure is the well-known projected gradient method 

^n-t-l — F(^(^Xji XF i^Xn')') ^ 

where Pc denotes the metric projection onto the set C and A is some positive 
number. In order to converge, however, this method requires the restrictive as¬ 
sumption that F be strongly (or inverse strongly) monotone. The extragradient 
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method proposed by Korpelevich and Antipin [1,11], according to the following 
formula, overcomes this difficulty 

jyn= PciXn — XF{Xn)) 

^^n+1 — Pc(,^n XF ^ 

where A G (0, ■^). The extragradient method has received a great deal of attention 
by many authors, who improved it in various ways; see, e.g., [4,7,9,17] and references 
therein. We restrict our attention only to one extension of the extragradient method. 
It was proposed in [4] by Censor, Gibali and Reich 

{ Vn = PciXn - XF{Xn)) 

Tn = {W G H \ {Xn - XF{Xn) -yn,W - yn) < 0} (1.3) 

^n+1 — PTn^Fn. 


where A G (0, j^). Since the second projection in (1.3) can be found in a closed 
form, this method is more applicable when a projection onto the closed convex set 
C is a nontrivial problem. 

As an alternative to the extragradient method or its modifications is the fol¬ 
lowing remarkable scheme proposed by Tseng in [20] 


Pn — Pc{,^n XF(^Xn')') 

^n+1 — Vn F A(7^(Xyj) F ^ 


(1.4) 


where A G (0, ■^). Both algorithms (1.3) and (1.4) have the same complexity per 
iteration: we need to compute one projection onto the set C and two values of F. 

Popov in his work [16] proposed an ingenious method, which is similar to the 
extragradient method, but uses on every iteration only one value of the mapping 
F. Using the idea from [4,12], Malitsky and Semenov improved Popov algorithm. 
They presented in [13] the following algorithm 

{ Tn = {w G H \ {xn - XF{yn-i) -yn,w- yn) < 0} 

Xn+l = PT„{Xn - AF(?/„)) (1.5) 

2/n+l — Pc(^n+1 XF(^yn))^ 

where A G (0, ■^]. It is easy to see that this method needs only one projection onto 
the set C (as in (1.3) or (1.4)) and only one value of F per iteration. The latter 
makes algorithm (1.5) very attractive for cases when a computation of operator F 
is expensive. This often happens, for example, in a huge-scale VIP or VIP that 
arises from optimal control. 

In this work we propose the following scheme 

Xn-\-l — PciXn XF {^Xn l)); (^■^) 

where A G (0, ^~^ ). Again we see that both algorithms (1.5) and (1.6) have 
the same computational complexity per iteration, but the latter has more simple 
and elegant structure. Algorithm (1.6) reminds the projected gradient method, 
however, a value of a gradient is taken in the point that is a reflection of Xn-i in 
Xn- Preliminary results from numerical experiments, comparing algorithm (1.6) to 
others, are promising. Also note that a simple structure of (1.6) allows us to obtain 
a nonstationary variant of (1.6) with variable stepsize in a very convenient way. 

The paper is organized as follows. Section 2 presents some basic useful facts 
which we use throughout the paper. In Section 3 we prove the convergence of the 
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method (1.6). Under some more restrictive assumption we also establish its rate of 
convergence. In Section 4 we present the nonstationary Algorithm 4.2 which is more 
flexible, since it does not use the Lipschitz constant of the mapping F. This makes 
it much more convenient in practical applications than Algorithm 1.6. Section 5 
contains the results of numerical experiments. 

2. Preliminaries. The next three statements are classical. For the proof we 
refer the reader to [2,3]. 

Lemma 2.1. Let M he nonempty closed eonvex set in H, x € H. Then 

(i) {Pmx -x,y- Pmx) >0 Vy € M; 

(ii) \\PMX-y\\^ <\\x-y\\^-\\x- Pmx\\^ 'iy € M . 

Lemma 2.2 (Minty). Assume that F : C ^ H is a continuous and monotone 
mapping. Then x* is a solution of (1.1) ijf x* is a solution of the following problem 

find X G C such that {F{y),y — x) > 0 Vy € C. 

Remark 2.1. The solution set S of variational inequality (1.1) is closed and 
convex. 

As usual, the symbol ^ denotes a weak convergence in F[. 

Lemma 2.3 (Opial). Let (xn) be a sequence in H such that Xn x. Then for 
all y ^ X 


liminf \\xn — a;|| < liminf ||x„ — y\\. 

n—¥oo n—^oo 


The proofs of the next four statements are omitted by their simplicity. 

Lemma 2.4. Let M be nonempty closed convex set in H , x G H and x = Pmx. 
Then for all X > 0 


Pm{x + X[x — x)) = X. 


Lemma 2.5. Let u, v be in H. Then 

||2u-uf =2||uf-||H|V2||u-uf . 

Lemma 2.6. Let (an) and (bn) be two real sequences. Then 
liminf a„ + liminf < liminf(a„ + 6„). 

n— >-oo n—yoo n—yoo 

Lemma 2.7. Let {an), (bn) be two nonnegative real sequences such that 

an+l an bn. 

Then (on) is bounded and lim„_>oo = 0. 

In order to establish the rate of convergence, we need the following 
Lemma 2.8. Let (a„), (6„) be two nonnegative real sequences and ao,P G (0,1) 
such that for all a G (0,ao) the following holds 

On+i + bn+i ^ (1 ~ 2a)a„ + aon-i + Pbn. (2.1) 

Then there exist 7 G (0,1) and M > 0 such that a„ < for any n > 0. 

Proof First, note that a simple calculus ensures that f{x) = jg 

decreasing on (0,1). Since / is continuous and /(O) = 1, we can choose a G (0,ao) 
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such that f{a) > (3. Our next task is to find some 7 G (0,1) and 5 > 0 such that 
( 2 . 1 ) with a defined as above will be equivalent to the inequality 

Ori+l + ^0,n + &n+l < 7(®n + + Phn- 


It is easy to check that such numbers are 

1 - 2 a + VI + 4a2 

7 =-- 


e (0,1), 


, -l + 2 a +Vl + 4a2 ^ 

a =-5->0. 

Then by 7 = /(a) > /3 we can conclude that 

a-n+i + 3an + bn +1 < 7(a„ + (5a„-i + bn). 


Iterating the last inequality simply leads to the desired result 

a„+i + 6 an + bn+i < j(an + San-i + bn) < ■ ■■ < 7"(ai + + 61) = 

where M = ai + Sag + bi > 0. □ 

3. Algorithm and its convergence. We first note that solutions of (1.1) 
coincide with zeros of the following projected residual function: 

r(x,y) := \\y - Pcix - XF{y))\\ + ||a; - ?/||, 


where A is some positive number. Now we formally state our algorithm. 
Algorithm 3.1. 

1. Choose xo = yo € H and X G (0, ^~^ ). 

2. Given the current iterate Xn and yn, compute 

Xn+l = Pc{Xn - XF{yn)) 

3. If r(xn,yn) = 0 then stop: Xn = Pn = Xn+i is a solution. Otherwise 
compute 


Un+l — ‘IXn+1 Xn: 

set n \= n + 1 and return to step 2. 

Next lemma is central to our proof of the convergence theorem. 

Lemma 3.1. Let (xn) and (yn) be two sequences generated by Algorithm 3.1 
and let z £ S. Then 

||a:„+i - z\\'^ < \\Xn - z\\^ - (1 - AL(1 + V2)) \\Xn - Xn-l\f + XL \\Xn - yn-l\\^ 

- (1 - V2XL) \\Xn+l - yn\\^ - 2A {F{z),yn - z) . (3.1) 


Proof. By Lemma 2.1 we have 

||x„+i - z\f< \\Xn - XF{yn) - z\f - \\Xn “ XF{yn) - Xn+l\\^ 

= \\Xn - Z\f - \\Xn+l - Xn\f “ 2A {F{yn),Xn+l “ z) . (3.2) 

Since F is monotone, 2A {Ffpn) — F{z), yn — z) > 0. Thus, adding this item to the 

right side of (3.2), we get 

\\Xn+l - Z\\'^ < \\Xn - Z\\'^ - \\Xn+l “ Xn\\^ + 2A (i^(y„), ?/„ - Xn+l) “ 2A {F{z),yn - z) 

= \\Xn - Z\f - \\Xn+l - Xn\f + 2X{F{yn) - F(?/„_i),2/„ - Xn+l) 

+ 2A {F{yn-i): yn “ Xn+l) “ 2A {F{z),yn - z). (3.3) 
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As Xn+i,Xn-i € C, we have by Lemma 2.1 

{Xn Xn—1 — ^ Xji Xn-\-l} "F: 0 ; 

{Xn - Xn-1 + XF{yn-l), Xn “ Xn-l) < 0. 

Adding these two inequalities yields 

{xn Xji—i J- \Fj yji Xyi-|_i) ^ 0, 

from which we conclude 

2A {F{yn-i), y-n - Xn+i) < 2 (x„ - x„+i - y„) = 2 (y„ - a;„, Xn+i - Vn) 

= \\Xn+l - Xn\f - \\Xn - yn\f “ l^n+l “ 2/n||^ , (3.4) 


since Xn - Xn-l = Vn - Xn- 

We next turn to estimating — F{yn-i),Xn+i — yn)- It follows that 

2X{F{yn) - F{yn-i),yn - Xn+i) < 2AL ||?/„ - y„-i|| ||a;n+i - yn\\ 

< \\y^ ~ ^ \\Xn+l - Vnf) 

< AL-^((2 + V 2 ) \\yn - Xn\f + a/2 \\Xn - Vn-iW^) + V2XL \\Xn+l “ 

= AL(1 + a/2) \\yn - Xn\f + XL ||x„ - yn-lf + a/2AL \\Xn+l - yn\\^)- (3.5) 

Using (3.4) and (3.5), we deduce in (3.3) that 

\\xn+i - zW^ < \\xn - z\f - (1 - AL(1 + V2)) \\xn “ yn\f + XL ||a;„ - yn-l\\^ 

- (1 - \/2XL) \\Xn+l - yn\\^ - 2A {F{z),yn - z ). 

which completes the proof. □ 

Now we can state and prove our main convergence result. 

Theorem 3.2. Assume that (C1)-(C3) hold. Then any sequence (Xn) gener¬ 
ated by Algorithm 3.1 weakly eonverges to a solution o/(l.l). 

Proof. Let us show that the sequence (xn) is bounded. Fix any z € S. Since 
1 — \/2XL > XL and 

{F[z),yn - z) =2 {F{z), Xn - z) - {F{z), Xn-l - z) 

> {F{z), Xn- z) - (T(z), Xn-l “ z) , 

we can deduce from inequality (3.1) that 

\\xn-ii - z\\^ + XL ||a;„+i - + 2A {F{z), Xn - z) 

< \\xn - 2;||^ + AL \\Xn - yn-l\f + 2A (T(z), Xn-l - z) (3.6) 
- (1 - AL(1 + -^2)) \\Xn - Xn-l\f . 

For n > 2 let 

On = \\Xn - Zll^ + AL \\Xn “ J/n-l ||^ + 2A {F{z), Xn-l “ z) , 
bn = {I - AL(1 + a/2)) \\Xn - Xn-l\f ■ 

Then we can rewrite inequality (3.6) as an+i ^ an — bn- By Lemma 2.7, we con¬ 
clude that (a„) is bounded and lim„^oo \\xn — = 0. Therefore, the sequences 

(liccra — 2 :||) and hence (xn) are also bounded. From the inequality 

ll^n-t-l UnW ‘Li ll^n-t-1 Xn\\ “t“ \\Xn VriW — H^^n+l Xn\\ F \\Xn 1 || 
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we also have that lini„^oo lla^n+i — VnW = 0. 

As {Xn) is bounded, there exist a subsequence of (a;„) such that 

converges weakly to some x* G H. It is clear that (j/n,) is also convergent to that 
X*. We show X* € S. From Lemma 2.1 it follows that 

^Xrii + l ^rii {yrii) 1 U ^n^ + l) ^ 0 \fy G C. 

From this we conclude that, for all j/ S C, 

0 < {xm+i - Xni,y - Xm+i) + A {F{yn,),y- J/n^) + A {F{yn,),yni - Xm+i) 

< (x„,+i - Xni,y- Xm+i) + X{F{y),y - y^) + A - x„,+i). (3.7) 

In the last inequality we used condition (C2). Taking the limit as i —>■ oo in (3.7) 
and using that limi^oo Wxm+i - x^W = hmi_>oo WUrn+i - VmW = 0, we obtain 

0 < {F{y),y - X*) VyGC, 

which implies by Lemma 2.2 that x* G S. 

Let us show ^ x*. From (3.6) it follows that the sequence (a„) is monotone 
for any z G S. Taking into account its boundedness we deduce that it is convergent. 
At last, the sequence (\\xn — yn-i\\^) is also convergent, therefore, (||xn — + 

2\{F{z),Xn-i — z)) is convergent. 

We want to prove that (xn) is weakly convergent. On the contrary, assume that 
the sequence (x„) has at least two weak cluster points x G S and x G S such that 
X ^ X. Let (Xn^) be a sequence such that Xn^. x as k ^ oo. Then by Lemma 2.6 
and 2.3 we have 

lim (||a;„ - x\f + 2A {F{x), Xn - x)) 

n—foo 

= ,lim (- xf + 2A {F{x), - x)) = lim ||x„, - xf 

K—^OO K—^OO 

= liminf - x\f < liminf \\xnk - x\\^ 

k—¥QO k—¥oo 

< liminf \\xnk — x\\^ + 2A lim inf {F{x), Xn^. — x) 

k—¥oo k—¥oo 

< liminf (||a;„;, - i||^ + 2A {F{x),Xnk “ x)) 

fc—>00 

= lim (l|x„ - i||^ + 2A(F(5),x„ - i)). 

n^oo 

We can now proceed analogously to the proof that 

lim (||a;„ - i||^ + 2A (F{x),Xn - x)) < lim (||a:„ - + 2A {F{x),Xn - x)), 

which is impossible. Hence we can conclude that Xn weakly converges to some 
x* G S'. □ 

It is well-known (see [19]) that under some suitable conditions the extragradient 
method has R-linear rate of convergence. In the following theorem we show that our 
method has the same rate of convergence under a strongly monotonicity assumption 
of the mapping F, i.e., 

(C2*) {F{x) - F(y),x-y) >m\\x-y\\'^ Vx, y G 77 

for some m > 0. 

Theorem 3.3. Assume that (Cl), (C2*), (C3) hold. Then any sequence (x„) 
generated by Algorithm 3.1 converges to the solution of (1.1) at least R-linearly. 

Proof. Since F is strongly monotone, (1.1) has a unique solution, which we 
denote by 2 . Note that by Lemma 2.5 

\\yn - z\f = 2 ||a;„ - 2 ;||^-||a;n_i - 2:||^-|-2 ||a;„ - x„-i||^ > 2 ||x„ - 2 ;||^-||xn_i - z||^ 
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From this and from (C2*) we conclude that, for all mi C (0,m], 

2X{{F{yn) - F{z),yn - z) - mi (2 \\xn - zW^ - ||x„_i - z\f)) 

> 2X{{F{y„) - F{z), yn- z) -m ||y„ - z\\^) > 0. (3.8) 

From now on, let mi be any number in (0,m]. Then adding the left part of (3.8) 
to the right side of (3.2), we get 

ll^n+l .^11 ^ W^n .2^11 ll^n +1 ^n|| 2A (F'(yyi), y^i X^j^i) 

- 2A {F{z), yn- z) - 2Ami(2 \\xn - z\f - \\xn-i - z\f) 

= (1 - 4Ami) \\Xn - z\\^ - \\Xn+l - Xn\\^ + 2Ami \\Xn-l - z\\^ 

+ 2A {F(^yn) Fijjn-l)-, yn J'n+l) T 2A {^F{ljn—l)t Vn ^n+l) 

- 2A {F{z),yn - z). 

For items {F{yn-i),y„ - Xn+i) and (F(y„) - F(?/„_i),?/„ - Xn+i) we use estima¬ 
tions (3.4) and (3.5) from Lemma 3.1. Therefore, we obtain 

\\Xn+l - z\f < (1 - 4Ami) \\Xn - z\f + 2Ami \\Xn-l - z\f 

- (1 - AL(1 + V2)) \\xn - Xn-i\\^ - (1 - V^XL) I|a ;„+1 - yn\f 

+ AT \\Xn - 2/n-ill^ - 2A (F(z), yn- z) . 

From the last inequality it follows that 

\\xn+i - z\\^ +{l-\/2XL) ||a;„+i - ?/„||^-|-4A {F{z),Xn - z) < (l-4Ami) \\xn - z\\^ 
+ 2Ami \\xn-i - z\f + XL \\xn - yn-i\f + 2A (i^(z),a;„_i - z) 

< (1 - 4Ami) ||a;„ - z\\‘^ + 2Ami ||a;n_i - z\\'^ 

+ max{ ^ “ V2XL) ||x„ - yn-if + 4,X{F(z),Xn-i - z)). (3.9) 

In the first inequality we used that {F{z), yn — z) = 2 {F{z),Xn — z) — {F{z), Xn-i — z 
and in the second we used that 

. r AT 1) 

2 < 4max<-=—, - I. 

ti-v^AT 2 / 

Set 

an = lla^n - z\f , 

= (1 - a/2AT) \\xn - Vn-iW^ -l-4A(T(z),x„_i - z ), 

.A f 11 

I 1 -V 2 AT 2 /’ 
a = 2Ami 
ao = 2Xm. 

As mi is arbitrary in (0,m), we can rewrite (3.9) in the new notation as 

an+i + bn+i ^ (1 ~ ‘2a)an + aan-i -f Va C (0, ao]- 

Since P < 1, we can conclude by Lemma 2.1 that a„ < 7 "M for some 7 C (0,1) 
and M > 0. This means that (x„) converges to z at least R-linearly. □ 
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4. Modified Algorithm. The main shortcoming of all algorithms mentioned 
in §1 is a requirement to know the Lipschitz constant or at least to know some 
estimation of it. Usually it is difficult to estimate the Lipschitz constant more or 
less precisely, thus stepsizes will be quite tiny and, of course, this is not practical. 
For this reason, algorithms with constant stepsize are not applicable in most cases of 
interest. The usual approaches to overcome this difficulty consist in some prediction 
of a stepsize with its further correction (see [9,14,20]) or in a usage of Armijo- 
type linesearch procedure along a feasible direction (see [7,17]). Usually the latter 
approach is more effective, since very often the former approach requires too many 
projections onto the feasible set per iteration. 

Nevertheless, our modified method uses the prediction-correction strategy. How¬ 
ever, in contrast to algorithms in [9,14,20], we need at most two projections per 
iteration. This is explained by the fact that for direction in Algorithm 3.1 we use 
a very simple and cheap formula: ?/„ = 2xn — Xn-i- Although we can not explain 
this theoretically, but numerical experiments show that cases with two projections 
per iteration are quite rare, so usually we have only one projection per iteration 
that is a drastic contrast to other existing methods. 

Looking on the proof of Theorem 3.2, we can conclude that inequality (3.5) is 
the only place where we use Lipschitz constant L. Therefore, choosing A„ such that 
the inequality A„ \\F{yn) — F{yn-i)\\ < a \\yn — yn-i\\ holds for some fixed a < 1 
we can obtain the similar estimation as in (3.5). All this leads us to the following 
Algorithm 4.1. 

{ Choose xq = yo & F[, \q > 0,a G (0, ^/2 — 1). 

Choose A„ s.t. A„ llF(?/„) - U(?/„_i)l| < a l|i/„ - ,n > 0. 

^n+l — 7^c(^n ^nF {yri)^ ^ 

Un+l — ‘^Xn+l 

Although numerical results showed us effectiveness of Algorithm 4.1, we can 
not prove its convergence for all cases. Nevertheless, we want to notice that we did 
not find any example where Algorithm 4.1 did not work. Thus, even Algorithm 4.1 
seems to be very reliable for many problems. 

Now our task is to modify Algorithm 4.1 in a such way that we will be able to 
prove convergence of the obtained algorithm. For this we need to distinguish the 
good cases, where Algorithm 4.1 works well, from the bad ones, where it possibly 
does not. 

From now on we adopt the convention that 0/0 = -boo. Clearly, it follows that 
1/0 = -boo as well. The following algorithm gets round the difficulty of bad cases 
of Algorithm 4.1. 

Algorithm 4.2. 

1 . Choose xq € C, A_i > 0, tq = 1, a € (0, — 1) and some large A > 0. 


Compute 


yo = Pc{xo - >^-iF{xo)) 



xi = Pc{xo - XoF{yo)). 


2. Given Xn and Xn-i, set = 1 and let X{y,T) be defined by 



(4.1) 
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Compute 


Un — 2 x ^1 1, 

•^n — ^i^yriT^ri) j 
Xn+i = Pc{Xn - XnF{yn)). 

3. If r{xn,yn) = 0 then stop: x„ = j/„ = x„+i is a solution. Otherwise 
compute 

tn= - \\Xn+l - Xn\\^ + 2A„ {F{yn), J/n “ Xn+l) + (1 “ «(! + V^)) \\Xn “ 2/n||^ 

- a \\xn - Vn-iW^ + (1 - V^a) ||a;«+i - ?/«||^ ■ 

4. Iftn<0 then set n := n + 1 and go to step 2. Otherwise we have two cases 
An ^ An—1 and An ^ An—l- 

(i) //An > An -1 then choose A^ G [An-i,An] such that 

WK^iVn) - Xu-iF{yn-i)\\ < a \\yn - Vu-iW ■ (4.2) 


Compute 


Xn+l — Pc^Xn Xj^Fi^y^if). 


Set An := A(j, n := n + 1 and go to step 2. 

(ii) If Xn < An-i then find € (0,1] such that 

y'^=Xn+ T^iXn - Xn-l) 

Xiy'n^T^) > r^An-l. 

Then choose A(j G [r/An-i, A(y(j,r/)] such that 

WK^iVn) - <A„-iF( 2 /n-i)|| < a \\y'^ - yn-i\\ ■ 


(4.3) 


(4.4) 


Compute 

Xn+l = Pc{Xn - X'^F{y'J) 

Set An := A(j, Tn := r/, := y!^, n := n + 1 and go to step 2. 

It is clear that on every iteration in Algorithm 4.2 we need to use a residual func¬ 
tion with different A, namely, r(x„, i/n) = WVn - Pcixn - An/’(yn))|| + \\xn - VnW- 
First, let us show that Algorithm 4.2 is correct, i.e., it is always possible to 
choose A(j and on steps (4.i) and (4.ii). For this we need two simple lemmas. 
Lemma 4.1. Step (f.i) in Algorithm f.2 is well-defined. 

Proof. From the inequality 


\\Xn-lF{yn) - A„-iF(l/n-l)|| < A„ \\F{yn) - F{yn-l)\\ < a \\yn - 2/n-l|| 


we can see that it is sufficient to take X'„ = A„-i. (However, it seems better for 
practical reasons to choose X'^ as great as possible). □ 

Lemma 4.2. Step (f.ii) in Algorithm f.2 is well-defined. 

Proof. First, let us show that A(?/,r) > ajL for all y,yn € H and r G (0,1]. It 
is clear that for every u,v € H 

a ||m — n|| a ||u — i;|| a 
||/^(m) -/^(?;)|1 - L||m- i;|| L' 

Then Aq > a/L and hence by induction 


A(y, r) := min| 


a\\y-yn-i\\ 

||F( 2 /)-F( 2 /„-i)|| 


1 + Tn-l 

An -1 





T 
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Therefore, it is sufficient to take = --- € (0,1]. (But, as above, it seems 

better to choose as great as possible). 

At last, we can prove the existence of X'^ G [r^Xn-i, X(t/^, such that (4.4) 
will hold by the same arguments as in Lemma 4.1. □ 

The following lemma yields an analogous inequality to (3.1). 

Lemma 4.3. Let (x„) and (yn) be two sequences generated by Algorithm ^.2 
and let z G S, a G (0,\/2 — 1). Then 

||a;„+i - z||^ < \\xn - z\\^ - (1 - a{l + a/ 2 )) ||a;„ - yn\f - (1 - V^a) ||x„+i - yn\f 
+ a \\xn - yn-i\f - 2A„ {F{z), yn- z). (4.5) 


Proof. Proceeding analogously as in (3.2) and in (3.3), we get 

ll^n+l 2^11 ^ ll^n .2^11 ll^n+1 Xn\\ T 2An {F{yn) F ijjn—l^ ^ yn Xn+l) 

+ 2A„ {F{yn-i), yn - Xn+i) - 2A„ {F{z), yn - z). (4.6) 

The same arguments as in (3.4) yield 

2An—1 {F (jjn—l') ^ yn ^n+l) ^ 2 (Xn Xn — lj Xn+1 ^/n) — 2 (yn Xn, Xn+1 yn) 

= ||x„+i - x„||^ - ||x„ - yn\f - \\xn+i - yn\f ■ (4.7) 
Using (3.5) and the inequality A„ ||U(?/„) - F(j/„_i)|| < a \\yn - we obtain 

2 An (^F(jjn^ F (^n —1) ; yn 3)^71+1) ^ 2o; |[ ^/n Vn—l || ||^n+l yn || 

< a(l + \/2) |ly„ - x„||^ + a ||x„ - yn-i\\^ + V^a ||x„+i - y„||^ . (4.8) 

Applying (4.7) and (4.8) in (4.6), we conclude that 

||x„+i - z\\^ < \\Xn - z\\^ - ||x„+i - X„||^ 

+ a((l + a/2) ||x„ - yn\f + ||x„ - yn-i\f + V2 ||xn+i - yn\f) 

+ T^(ll®«.+ 1 “ ^n\f - ||a;„ - yn\f - ||a;n+l - yn\f) 

An-1 

-2An(F(z),yn-z). (4.9) 

Now we have a lot of cases 

(a) An — An— 1 ; 

(b) An An—1 and ||3^n+l ^n|| ^ ll^n yn\\ ||^n+l I/nll ; 

(c) An ^ An—1 and ||^n+l ^n\\ ^ W^n yn\\ ||^n+l yn\\ ; 

(d) An An—1 and ||xn+l ^n\\ ^ W^n yn\\ ||^n+l yn\\ ; 

(e) An ^ An —1 and ||^n+l ^n|| ll^n yn\\ ||^n+l yn\\ ■ 

Let US consider all them. 

Case (a). It is quite obvious that when An = An-i we obtain inequality (4.5). 
Cases (b) and (e). In both cases we notice that 

- T^-) ||x„+i - Xnf > (1 - T^-) (||x„ - y 

An—1 An—1 


(1 


'nil + ll^^n+l J/n II ) ■ 
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Thus, from (4.9) we conclude that 

Ikn+l - z\\^ < \\Xn - z\\^ - (1 - T^) l^n+l “ Xn\\^ 

^n-1 

+ a((l + V2) \\Xn - Unlf + \\Xn “ 2/n-l||^ + V2 \\Xn+l “ UnW^) 

- T^dkn - yn\f + \\Xn+l “ VnW^) “ 2A„ {F{z),yn “ z) 

An-1 

< \\Xn - Z\\^ - \\Xn - Vnlf “ ||a;n+l “ Vnlf “ 2A„ {F{z), yn “ z) 

+ a((l + v^) \\Xn - yn\\^ + \\Xn “ 2/n-l||^ + V2 \\Xn+l “ 2/n||^) 

= \\Xn - Z\f - (1 - a(l + V^)) \\Xn “ 2/ri||^ “ (1 “ V^a) \\Xn+l “ l/n||^ 
+ a. \\Xn - yn-i\t - 2A„ {F{z),yn - z) , 

which is exactly inequality (4.5). 

Observe that inequality (4.9) with condition < 0 is equivalent to (4.5). So, 
from now on we assume that > 0. 

Case (c) and > 0. It is clear that we can rewrite (4.6) (with X'^ instead of 
A„) as 

ll^n+l -^ll ^ \\Xn -^11 ll^n+1 11 T 2 {X^Fi^y^^ Ayj_i ), y^ Xyi_|_i) 

+ 2An_i {F(^yn—l), yn Xn+l) 2A„ {F(^z), yn z) 

< \\Xn - z\f - ||a;„+l - Xn\\^ + 2a \\yn - yn-l\\ hn - a^n+lll 

+ 2An_i {F(^yn—i), yn Xn+i) 2A„ {F(^z), yn z). 

Now using (4.7) and the second inequality in (4.8), we get easily (4.5). 

Case (d) and > 0. Let A(j_]^ = r,!(A„_i and = x„ —t,((x„— x„_i). Since 
X'^_i < A„-i, the point x'^_i lies on the segment [Xn-i^Xn] and the point x'j^_i — 
X[^_iF(yn-i) lies on the segment [x^Xn-i — Xn-iF{yn-i)]. Thus by Lemma 2.4 

Xn = Pc{x'^-i - A(,_iF(?/„_i)). 

Also we have that Xn+i = Pc{xn — ^nPiVn)) v'n = 2x„ — x'^_i. Applying the 
same arguments as above for this new Xn+i, we do not get into the Case (d), since 
A(i > A(j_i, and do not get into the Case (c), since 

WKPiy’n) - K-lPiyn-l)\\ < a \\y’„ - yn-l\\ ■ 


This means that we get 

\\Xn+l - zf < \\Xn - zf - (1 - a(l + V^)) \\Xn “ y'n\t “ (1 “ ^a) l^n+l “ y'nf 
+ a \\xn - yn-if - 2A(, (F(z), y'n- z). 

Hence, for redefined values we obtain the desired inequality (4.5) and the proof is 
complete. □ 

Before proceeding, we want to give some intuition to Algorithm 4.2. First, note 
that A is the upper bound of (A„). It guarantees that our stepsize does not go to 
infinity. The item Xn-i in (4.1) provides us necessary condition to obtain 

the good estimation of 2A„ {F{z),yn — z). This seems not very important, since 
in many cases {F{z),yn — z) > Q and we can just not take into account this item 
in (4.5). But this condition also ensures that A„ is not too large in comparison with 
An_i. This may be usefnl if we get into the Case (c) or (d) in Lemma 4.3, since 
we will be able to perform (4.2) or (4.4) faster. Also note that usnally we have 
Xn < 2A„_i, since t„ I only in the Case (d). 
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The inequality < 0 ensures at once that on the n-th iteration inequality (4.5) 
holds. Therefore, we do not need to immerse into a branch of the cases in Algo¬ 
rithm 4.2 or Lemma 4.3. Also note that a former and a latter use different classi¬ 
fications of possible cases. In Lemma 4.3 the inequality < 0 holds certainly for 
cases (a), (b), and (e). But it happens very often that even in such bad cases as (c) 
or (d) tn < 0 also. Thus, by checking whether < 0, we can avoid to get into the 
undesirable steps (4.i) or (4.ii). 

If > 0 and An > A„_i, then we get into step (4.i). Actually, this case is 
not so bad. We have two possibilities here. The first one is to decrease A„ until it 
satisfies (4.2). And the second one is just to set A(j = A„_i. The first one seems 
to be better, since we suppose, as usual, that a larger stepsize provides a smaller 
number of iterations (however, it needs some extra work to check condition (4.2)). 

It only remains the most tedious case when > 0 and A„ < A„_i. In this 
case we show that it is always possible to change ?/„ = 2xn — Xn-i to some other 
Un, defined by j/n = (1 + Tn)xn — TnXn-i with some Tn G (0,1), which we choose to 
ensure that (4.3) and (4.4) hold. Next, with new j/n and A„ we compute Xn+i as in 
step 2. The conditions (4.3) and (4.4) guarantee that for new tn we have < 0. 
However, when we go to the next iteration, we must not forget that yn and A„ have 
been changed. 

Now we prove the convergence of Algorithm 4.2 in a quite similar way as in § 3. 

Theorem 4.4. Assume that (C1)-(C3) hold. Then any sequences (Xn) and 
iUn) generated by Algorithm 4^.2 weakly converge to a solution of (LI)- 

Proof. Let us show that the sequence (Xn) is bounded. Fix any z G S. Since 
AiAn < (1 + 'rn_i)A„_i, we have that 

An {F{z), yn - z) = An {F{z), (1 -|- T„)a;„ - TnXn-l “ z) 

= A„(l + Tn) {F{z), Xn- z) - AnT„ {F(z), X„_i - z) 

> An(I -b Tn) {F{z),Xn - z) - A„_i(l -f T„_i) {F{z),Xn-l “ z) . 

And hence we can deduce from inequality (4.5) that 

\\Xn+l - zW"^ + a \\Xn+l - yn\t + 2A„(1 -|- Tn) {F{z),Xn “ z) 

< \\Xn - z\f + a \\Xn - yn-l\f + 2A„_l(I -|- Tn-l) {F{z),Xn-l - z) 

- (1 - Q;(I -b V2)){\\Xn - yn\f + Ikn-Sl “ Vnlf)- (4.10) 

For n > 1 set 

a„+i = ll^n+l - z\f + a \\Xn+l - yn\f + 2An(l + T„) {F{z), Xn - z) , 

6 n = (1 - a(l + V2))i\\Xn - yn\f + \\Xn+l “ yn||^)- 

With this notation we can rewrite inequality (4.10) as On+i < On — bn. Thus by 
Lemma 2.7 we can conclude that (a„), and hence {xn), are bounded and lim„_>oo bn = 
0. From the latter it follows immediately that 

lim ||xn+i - 2 /n|| = 0 and lim ||x„+i - Xn|| = 0. 

n—¥oc n—¥oo 

From this point we need to proceed in much the same way as in Theorem 3.2. 
Thus, to avoid the repetition, we restrict our attention to only the place where 
arguments differ. This is inequality (3.7) where we want to go to the limit as 
*—>■ 00 . Recall that A and ^ are the upper and the lower bounds of the sequence 
(A„), respectively (the former follows form the definition of A„ and the latter from 
Lemma 4.2). Consequently, we can conclude that the sequence (A„) has a cluster 
point, and moreover, all its cluster points are not equal to zero. This means that 
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when we go to the limit in (3.7) we obtain the same result as in Theorem 3.2. This 
is the desired conclusion. □ 

Algorithm 4.2 allows several modifications. In particular, instead of checking 
whether tn 0, one can require at once that A„, which we choose in step 2, satisfies 
the following inequality 


\\KF{yn) - A„_iF(y„_i)|| < a ||y„ - y„_i|| . 


The arguments, used in steps (4.i) and (4.ii), show that is always possible to do 
(perhaps, with changed y„). Although this algorithm have only one projection per 
iteration, our computational experience shows that it is less effective than Algo¬ 
rithm 4.2. This is explained again by the fact that very often < 0 even for not 
suitable A„. 

5. Numerical illustration. In this section we examine the benchmark of 
Alg. 3.1 and 4.2. For this, first, we present the performance of stationary algorithms 
on some affine problems. And then we compare nonstationary Alg. 4.2 with two 
well-known methods. For abbreviation we use the following. 

Stationary methods: 

• EGM for algorithm described by (1.2); 

• SubEGM for algorithm described by (1.3); 

• TBFM for algorithm described by (1.4); 

• SubPM for algorithm described by (1.5). 

Nonstationary methods: 

• Alg. Solod.-Svait. for the algorithm of Solodov and Svaiter [17]; 

• TBFM-nst for the nonstationary backward-forward algorithm of Tseng [20]. 

Note that in [17] there were considered two algorithms, so we chose the second 

one with the better performance. 

For a benchmark of all algorithms we included the number of iterations (iter.) 
and the time of execution (time). For the nonstationary algorithms we also added 
the total number (np) of projections and the total number (nf) of all values F that 
were evaluated. 

We describe the test details below. Computations were performed using Wol¬ 
fram Mathematica 8.0 on an Intel Core i3-2348M 2.3GHz running 64-bit Linux 
Mint 13. The time is measured in seconds using the intrinsic Mathematica function 
Timing. The projection onto the feasible set C was performed using the intrinsic 
Mathematica function ArgMin (for Problems 1, 4, and 5 the projection is explicit). 
The termination criteria are the following 


EGM, SubEGM, TBFM, Solod-Svait., TBFM-nst. \\xn - VnW < e, (5.1) 


r{xn+i,yn) < e, (5.2) 
r{Xn,yn)<£- (5.3) 


SubPM 
Alg. 3.1, 4.2 


We do not use the simple criterion jja; — Pc{x — AF(a;))|] < e for Alg. 3.1, 4.2, and 
SubPM because we do not want to compute Pc{x — XF{x)) extra. The better way 
is to verify that 

Wun Pcivn ^F{yn))\\ ^ Ijj^n-l-l 2/n || + Ijs^n-l-l Pcijjn ^F{yn))\\ 

— ll^n-t-1 VriW T W^n VnW — 1/n)- 

The last inequality holds, since Pc is nonexpansive. The similar argument works 
for SubPM. Moreover, we notice that termination criteria (5.2), (5.3) are stronger 
than (5.1). This means that the solutions generated by Alg. 3.1, 4.2, and SubPM 
are not worse (or even better) than the solutions generated by other algorithms if 
we measure the quality of the solution by the projected error bound. 
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m 

EGM 

SubEGM 

SubPM 

Alg. 3.1 


iter. 

time 

iter. 

time 

iter. 

time 

iter. 

time 

500 

129 

1.3 

129 

1.3 

109 

0.6 

92 

0.5 

1000 

133 

5.6 

133 

5.7 

120 

2.6 

95 

2 

2000 

138 

22 

138 

22 

121 

10 

98 

8 

4000 

143 

98 

143 

101 

122 

43 

101 

33 


Table 5.1 

Results for EGM, SubEGM, SubPM, and Alg 3.1 for Problem 1. L = 1, X = 0.4, e = 10“^ 


m 

EGM 

SubEGM 

TBFM 

SubPM 

Alg. 3.1 


iter. 

time 

iter. 

time 

iter. 

time 

iter. 

time 

iter. 

time 

10 

82 

7.7 

82 

3.9 

82 

3.9 

83 

4.0 

83 

4.0 

10 

146 

15.2 

148 

7.8 

148 

7.8 

152 

8.0 

147 

7.8 

10 

63 

6.3 

63 

3.2 

63 

3.2 

63 

3.2 

63 

3.2 

20 

78 

10.2 

79 

5.2 

79 

5.3 

79 

5.3 

79 

5.3 

20 

101 

14.5 

102 

7.3 

102 

7.4 

101 

7.3 

102 

7.4 

20 

no 

17.6 

no 

8.9 

no 

8.9 

no 

8.9 

111 

9.0 

50 

224 

243 

226 

126 

226 

128 

231 

134 

225 

134 

50 

236 

277 

237 

144 

238 

143 

241 

151 

237 

144 

50 

444 

507 

447 

265 

447 

266 

455 

263 

445 

267 


Table 5.2 

Results for EGM, SubEGM, TBFM, SubPM, and Alg 3.1 for Problem 2. L = ||M||, 
A = 0.4/L, e = 10-3 


Problem 1. Our first example is classical. The feasible set is C = R™ and F{x) = 
Ax, where A is a square matrix m x m defined by condition 


- 1 , 

1 , 

0 , 


j=m + l — z>i 
j = m + l — i<i 
otherwise. 


This corresponds to a matrix whose secondary diagonal consists half of — Is and half 
of Is, the rest of the elements being zero. For even m the zero vector is the problem 
solution of (1.1). This is a classical example of a problem where usual gradient 
method does not converge. The results of the numerical experiments are compiled 
in Table 5.1. We did not include the results of TBFM, since for this problem it is 
identical with EGM. For all tests we took = (1,..., 1), e = 10“^ and A = 0.4. 
Problem 2. We take F{x) = Mx + q with the matrix M randomly generated as 
suggested in [6]: 

M = AA^ + B + D, 


where every entry of the mxm matrix A and of the mxm skew-symmetric matrix B 
is uniformly generated from (—5,5), and every diagonal entry of the mxm diagonal 
D is uniformly generated from (0, 0.3) (so M is positive definite), with every entry 
of q uniformly generated from (—500, 0). The feasible set is 

C = {x G R+ I xi -I- a :2 H- Xm = m}, 

and the starting point is xq = (!,...,!). For this problem we took L = \\M\\. 
For every m, as shown in Table 5.2, we have generated three random samples with 
different choice of M and q. The error is e = 10“^. 

As one can see from Table 5.2, EGM has the worst performance. This is explained 
by the presence of two projections per iteration, which played the key role, since 
for small m the computation of F is cheap. All others algorithms behave similarly. 
Also it is very likely that with an increasing of m they will continue to have ap¬ 
proximately the same number of iterations. However, for large m the computation 


























































PROJECTED REFLECTED GRADIENT METHODS 


15 



£ 

Alg. 4.2 

Alg. Solod.-Svait. 

TBFM-nst 



iter, (np/nf) 

time 

iter, (np/nf) 

time 

iter, (np/nf) 

time 

(1,1,1,1) 

10-'^ 

36 (36/36) 

1.0 

6 (11/11) 

0.4 

62 (66/128) 

2.1 


lO^® 

72 (82/86) 

2.2 

6 (11/11) 

0.4 

156 (160/316) 

4.9 

(0.5, 0.5, 2,1) 

10"^ 

41 (41/41) 

1.1 

18 (35/58) 

1.4 

69 (73/142) 

2.2 


10-*' 

75 (87/86) 

2.4 

— 

> 50 

163 (167/330) 

5.0 

random 

10-** 

57 (57/58) 

1.6 

13 (25/49) 

1.2 

72 (77/149) 

2.3 


10-*' 

63 (67/65) 

1.9 

— 

> 50 

79 (84/163) 

2.6 

random 

lO-*' 

40 (40/40) 

1.0 

20 (39/73) 

1.5 

66 (70/136) 

2.0 


lO-** 

71 (82/74) 

2.0 

— 

> 50 

160 (164/324) 

4.9 

random 

10-** 

59 (59/59) 

1.5 

15 (29/40) 

1.1 

87 (91/178) 

2.6 


lO”*’ 

93 (99/105) 

2.5 

— 

> 50 

181 (185/366) 

5.7 


Table 5.3 

Results for Alg. f.2, Alg. Solod.-Svait., and TBFM-nst for Problem 3 


of F will also matter, so for those cases we can expect that SubPM and Alg. 3.1 
will have substantially better performance, compared with the others. 

From now on, we present out test results for nonstationary methods. The choice of 
test problems for nonstationary algorithms is explained by a simple structure of the 
feasible set C. This is important for the better performance of TBFM-nst, since 
one must compute many projections per iteration, and Alg. Solod.-Svait., since on 
every iteration one must compute a projection onto the intersection of C with some 
hyperspace. (Of course, instead of a projection onto this intersection, we can project 
onto the set C. But in that case the stepsize will be smaller). 

In the implementation of Alg. Solod-Svait. we chose cr = 0.3, 7 = 0.5, 0 = 4; in 
the implementation of TBFM-nst. we chose [3 = 0.5, 0 = 0.9 where the notation is 
taken from [17] and [20] respectively. For Alg. 4.2 we used a = 0.4 and A_i = 0.01. 
Problem 3. Kojima-Shindo Nonlinear Complementarity Problem (NCP) was con¬ 
sidered in [15] and in [ 8 ], where m = 4 and the mapping F is defined by 


F{xi,X2,x:i,X4) 


3x1 + 2xiX2 + 2 x 2 -b a;3 -I- 3a;4 — 6 
2xi + xi + X 2 + 10x3 + 2x4 — 2 

3xf -b X 1 X 2 + 2x2 + 2x3 + 9x4 — 9 
xf -b 3x| -b 2x3 + 32:4 — 3 


The feasible set is C = {x € ] xi -b X 2 -b X 3 -b X 4 = 4}. First, we chose some 

particular starting points: x° = (1,1,1,1) and x° = (0.5,0.5, 2 , 1 ). And then we 
used different random points from C. For all starting points we had two tests: with 
e = 10“^ and e = 10“®. The results are summarized in Table 5.3. Here we aborted 
the evaluation of algorithm after 50 seconds. 

As one can see, the ratio nf /iter, for Alg. Solod.-Svait. is much higher than for 
Alg. 4.2. For this reason even with larger number of iterations Alg. 4.2 some¬ 
times outperforms Alg. Solod.-Svait. Note that TBFM-nst has the worst perfor¬ 
mance, since on every iteration it must compute many nontrivial projections onto 
the simplex (this is exactly what we meant at the beginning of §4). The solution 
of this problem is not unique, but most of the times all algorithms converge to the 
X* = (1.225, 0,0, 2.775). But for example for x° = (1,1,1,1) algorithms converge to 
the different solutions, this is the reason of such distinction in their performance. 
Also for e = 10“® Alg. Solod.-Svait. often converges very slowly, that is why it 
does not execute in 50 seconds. In that time, Alg. 4.2 and TBFM-nst have not this 
drawback, since their stepsizes A„ are separated from zero. 
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e 

Alg. 4.2 

Alg. Solod.-Svait. 

TBFM-nst 



iter, (np/nf) 

time 

iter, (np/nf) 

time 

iter, (np/nf) 

time 

5 

10 -'" 

20 (20/20) 

0.008 

18 (35/70) 

0.012 

22 (24/46) 

0.007 


10 -'" 

43 (43/43) 

0.014 

34 (67/134) 

0.018 

47 (49/96) 

0.019 

50 

10 -" 

23 (24/26) 

0.02 

20 (39/78) 

0.06 

28 (30/58) 

0.04 


10 -'" 

46 (47/49) 

0.04 

34 (67/134) 

0.1 

53 (55/108) 

0.07 

500 

10 -'" 

27 (28/30) 

0.6 

22 (43/86) 

1.6 

32 (34/66) 

1.0 


10 -'" 

50 (51/53) 

1.4 

36 (71/142) 

2.7 

58 (60/118) 

1.8 

1000 

lO-'" 

28 (29/31) 

1.4 

22 (43/86) 

4.8 

33 (35/68) 

2.8 


10 -'" 

51 (52/54) 

2.6 

36 (71/142) 

7.4 

59 (61/120) 

5.1 


Table 5.4 

Results for Alg. 4-2 o-'iT'd Alg. Solod.-Svait., and TBFM-nst for Problem 4 


Problem 4. This example was considered by Sun in [18], where 
F{x) = Fi{x) + F 2 {x), 

F 2 {x) = Dx + c, 

f^{x) = xl_i +Xi + Xi-iXi + XiXi+i, I = 1, 2,..., m 

Xq — Xjn+1 — 

Here H is a square matrix m x m defined by condition 

f 4, i = j, 


[O, otherwise, 

and c = (—1, —1,..., —1). The feasible set is C = and the starting point is 
= (0,0,..., 0). The test results are listed in Table 5.4 for different m and e. 
Problem 5. This problem was used by Kanzow in [8] with m = 5. The mapping F 
is defined by 


Fix) = (/i(x),...,/5(x)), 

5 

fi{x) = 2{xi - i + 2) exp|^(a:i - i + 2)^|, i = 1,..., 5 

i=l 

and the feasible set is C = The solution for this problem is x* = (—1, 0,1, 2, 3). 
The results of the numerical experiments for different starting points and e are 
compiled in Table 5.5. 

We want to note that for this problem only the number of iterations matter, 
since both projection and mapping F are cheap. For this reason, the time is quite 
small and hence, can not be a good illustration of the algorithm’s performance, 
since Mathematica function Timing measures time only with some error of order 
10-3. 

To summarize our numerical experiments, we want to make some observations. 
First, the disadvantage of Alg. 3.1 in comparison with other stationary algorithms is 
a smaller interval for possible stepsize A. For simple problems it really can matter, 
however, for huge-scale problems evaluation of F is much more important. To 
illustrate this, let us consider Problem 1 again. The best performance of EGM and 
SubEGM achieves approximately for A = 0.7 (SubPM and Alg. 3.1 do not converge 
for such A). Nevertheless, the time execution of EGM and SubEGM for m = 2000 
is 12 seconds for both, and for m = 4000 is 52 and 53 seconds, respectively. Hence, 
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£ 

Alg. 4.2 

Alg. Solod.-Svait. 

TBFM-nst 



iter, (np/nf) 

time 

iter, (np/nf) 

time 

iter, (np/nf) 

time 

(1,1,1,1,1) 

lO-'^ 

26 (26/26) 

0.009 

14 (27/53) 

0.009 

28 (29/57) 

0.007 


10 -'> 

49 (49/49) 

0.017 

24 (47/93) 

0.015 

52 (53/105) 

0.014 

(0,0, 0,0,0) 

10 “^ 

15 (18/35) 

0.009 

32 (63/82) 

0.015 

43 (44/87) 

0.012 


lO-® 

34 (37/54) 

0.016 

42 (83/122) 

0.025 

67 (68/135) 

0.017 

random 

lO-'^’ 

25 (28/41) 

0.012 

79 (157/175) 

0.029 

90 (91/181) 

0.024 


10 -“ 

59 (59/59) 

0.02 

89 (179/215) 

0.050 

114 (115/229) 

0.036 

random 

10 “^ 

20 (24/44) 

0.015 

23 (45/67) 

0.011 

36 (37/73) 

0.017 


10 -“ 

43 (47/67) 

0.022 

33 (65/107) 

0.017 

60 (61/121) 

0.016 


Table 5.5 

Results for Alg. 4.2 and Alg. Solod.-Svait., and TBFM-nst for Problem 5 


for Problem 1 with large m EGM and SubEGM, even with their best stepsize, are 
slower. 

Second, our numerical experiments confirm that “bad” cases (4.i) and (4.ii) 
in Alg 4.2 are really very rare. Notice that the number of evaluation of F is ap¬ 
proximately the same as the number of iteration, that is a drastic contrast with 
Alg. Solod.Svait and TBFM-nst. Also if even we do not take into account the good 
performance of Alg. 4.2, it seems to be more robust than Alg. Solod.-Svait., since it 
does not require any additional assumptions on the feasible set and its stepsizes are 
separated from zero. On the other hand, the proof of Alg. Solod-Svait., in contrast 
to Alg. 4.2, does not require the Lipschitz-continuity of the mapping F. 

Acknowledgments, The author wishes to express his gratitude to Mikhail 
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